CHAPTER SEVEN

Case #13: FORECASTING THE FEDERAL BUGDET DEFICIT ONCE AGAIN

Goal: This case introduces Box-Jenkins modeling in the context of forecasting the federal budget deficit for the period 1997M1-1997M4 as we did in Chapter Six, Case #1. Specifically, this case examines

Recall that in Chapter Six, Case #1 you produced forecasts of the monthly federal budget surplus/deficit for the period 1997M1-1997M4 using linear-trend time-series decomposition. Recall that your forecasts for 1997M1-1997M4 and actual values were:

 

Actual BUDGET

Forecast BUDGET

1997M1

13.36

14.66

1997M2

-44.01

-45.10

1997M3

-21.32

-49.68

1997M4

93.93

44.57

From this Table we calculated RMSE to be 28.48, i.e., the average error expected each month is approximately 28 billion dollars using linear trend decomposition. In this case we see if we can improve upon these forecasts applying Box-Jenkins models.

Problem Spreadsheet

The spreadsheet for this problem is C7_Case1.xls. It contains the following data:

Variable

Data Range

BUDGET

1990M1-1996M12

The series BUDGET is the non-seasonally adjusted monthly federal budget surplus (deficit). The series is the "official" federal budget numbers from the Congressional Budget Office.

Examining Data for Stationarity

To examine the behavior of monthly deficit data (BUDGET) over the historical period (1990M1-1996M12), we generated a time-series plot of the data using Excel.

Question #1: Viewing the time-series plot of the data, what behavior do you need to model?

ANSWER:

ARIMA Model Identification

The following ACF and PACF for BUDGET were generated using FORECASTXTM.

Question #2: What is a candidate ARMA-type model for generating monthly forecasts?

ANSWER:

Using the ACF and PACF estimates to identify a "candidate" ARMA model, remember our ARMA identification rules of thumb: (1) If the autocorrelation function abruptly stops at some point -- say, after q spikes -- then the appropriate model is an MA(q) model; (2) If the partial autocorrelation function abruptly stops at some point -- say, after p spikes -- then the appropriate model is an AR(p) type; (3) If neither function falls off abruptly, but both decline toward zero in some fashion, the appropriate model is an ARMA(p, q). By reference to the ACF and PACF correlograms, we see the data appears to have MA components at lags 1, 3, 11, and 12, whereas the AR components arise at lags 1,2, 5, 9, 11, and 12. This suggests up to an ARMA(12,12) model to generate forecasts into the holdout period and forecast horizon.

Estimating an ARMA Model

We can estimate an ARMA model in FORECASTXTM by simply editing the parameters that appear once you have selected Box Jenkins as the forecast method. Using FORECASTXTM we find that the highest order lags allowed are nine for the MA and AR terms. Accordingly, we estimated an ARIMA(9,0,9)*(0,0,0) model, which is an ARMA(9,9) model in the notation of the text, as a first pass at the data and generated the following reports. Note this requires we set all differencing and seasonal parameters to zero in the edit parameters box. Accordingly, our ARIMA(9,0,9)*(0,0,0) model has no data transformation aimed at eliminating trend and seasonality from the data.

Estimation results for the 1996 holdout period and the 1997 forecast period are shown below.

Forecast -- Box Jenkins Selected

 

 

 

 

Forecast

 

95% - 5%

95% - 5%

Date

Monthly

Quarterly

Annual

Upper

Lower

Jan-1997

17.16

38.20

-3.88

Feb-1997

-50.90

-21.15

-80.66

Mar-1997

-25.68

-59.42

10.76

-62.12

Apr-1997

28.57

70.65

-13.51

May-1997

-44.84

2.20

-91.89

Jun-1997

12.37

-3.90

63.91

-39.17

Jul-1997

-33.79

21.88

-89.46

Aug-1997

-28.62

30.89

-88.13

Sep-1997

39.47

-22.94

102.59

-23.65

Oct-1997

-46.79

19.74

-113.33

Nov-1997

-35.19

34.59

-104.97

Dec-1997

4.46

-77.52

-163.79

77.35

-68.42

Avg

-13.65

-40.95

-163.79

37.63

-64.93

Max

39.47

-3.90

-163.79

102.59

-3.88

Min

-50.90

-77.52

-163.79

-21.15

-113.33

The following accuracy statistics relate to the holdout period of 1996.

Accuracy Measures

 

Value

AIC

705.89

BIC

752.08

Mean Absolute Percentage Error (MAPE)

66.78%

Sum Squared Error (SSE)

13,961.51

R-Square

80.18%

Adjusted R-Square

74.69%

Root Mean Square Error

12.89

The following are the estimation results of the ARIMA(9,0,9)*(0,0,0) model.

Method Statistics

 

Value

Method Selected

Box Jenkins

Model Selected

ARIMA(9,0,9) * (0,0,0)

T-Test For Non Seasonal AR

-0.70

T-Test For Non Seasonal AR

-0.55

T-Test For Non Seasonal AR

0.22

T-Test For Non Seasonal AR

0.16

T-Test For Non Seasonal AR

0.63

T-Test For Non Seasonal AR

-0.15

T-Test For Non Seasonal AR

0.21

T-Test For Non Seasonal AR

0.00

T-Test For Non Seasonal AR

1.15

T-Test For Constant

-0.72

T-Test For Non Seasonal MA

-0.25

T-Test For Non Seasonal MA

-0.78

T-Test For Non Seasonal MA

0.21

T-Test For Non Seasonal MA

-0.07

T-Test For Non Seasonal MA

0.28

T-Test For Non Seasonal MA

-0.55

T-Test For Non Seasonal MA

0.19

T-Test For Non Seasonal MA

-0.19

T-Test For Non Seasonal MA

0.96

Finally, to see how well the ARIMA(9,0,9)*(0,0,0) model performs, we generated a correlogram of the estimated residuals as reported below.

Question #3: By examining the summary statistics above and the correlogram of the estimated residuals, how well does the ARIMA(9,0,9)*(0,0,0) model fit the data?

ANSWER:

In addition, the ACF function has significant autocorrelation at lags 3, 6, and 12. Similarly, the PACF has a spike at lag 3, and is marginally significant at lag 12. This suggests the residuals are not white noise and the ARIMA(9,0,9)*(0,0,0) model is suspect. The problem appears to be seasonality in BUDGET data, which can easily be handled in FORECASTXTM using its seasonal differencing capabilities.

Estimating an ARIMA Model with Seasonal Adjustment

Using FORECASTXTM we re-estimated the model using the optimal parameter selection option, which automatically determines the degree of data level and seasonal differencing. Here simply leave all parameters blank in the edit parameters box and FORECASTXTM will select the optimal parameters.

The Expert Selection option in FORECASTXTM selected an ARIMA(1,0,1)*(0,1,0) model as the best. This model employs first-degree seasonal differencing with first-order non-seasonal MA and AR terms. Estimation results for the 1996 holdout period and the 1997 forecast period, as well as summary reports are shown below.

Forecast -- Box Jenkins Selected

 

 

 

 

Forecast

 

95% - 5%

95% - 5%

Date

Monthly

Quarterly

Annual

Upper

Lower

Jan-1997

14.58

32.78

-3.63

Feb-1997

-42.52

-16.78

-68.27

Mar-1997

-47.75

-75.70

-16.22

-79.29

Apr-1997

72.66

109.07

36.25

May-1997

-53.15

-12.44

-93.85

Jun-1997

34.38

53.89

78.97

-10.22

Jul-1997

-26.87

21.30

-75.04

Aug-1997

-41.83

9.67

-93.32

Sep-1997

35.26

-33.44

89.87

-19.36

Oct-1997

-39.81

17.76

-97.38

Nov-1997

-37.88

22.50

-98.26

Dec-1997

18.82

-58.87

-114.12

81.89

-44.24

Avg

-9.51

-28.53

-114.12

34.86

-53.88

Max

72.66

53.89

-114.12

109.07

36.25

Min

-53.15

-75.70

-114.12

-16.78

-98.26

The following estimation results of the ARIMA(1,0,1)*(0,1,0) model were obtained from FORECASTXTM.

Method Statistics

 

Value

Method Selected

Box Jenkins

Model Selected

ARIMA(1,0,1) * (0,1,0)

T-Test For Non Seasonal AR

-1.21

T-Test For Non Seasonal MA

-0.04

Note how FORECASTXTM handles seasonality by first-differencing the data and then applying non-seasonal AR and MA terms.

The following accuracy statistics relate to the holdout period of 1996.

Accuracy Measures

 

Value

AIC

661.02

BIC

665.88

Mean Absolute Percentage Error (MAPE)

54.93%

Sum Squared Error (SSE)

12,266.52

R-Square

82.58%

Adjusted R-Square

82.37%

Root Mean Square Error

12.08

To see how well the ARIMA(1,0,1)*(0,1,0) model fit the data, we generated a correlogram of the estimated residuals:

Question #4: By examining the summary statistics above and the correlogram of the estimated residuals, how well does the ARIMA(1,0,1)*(0,1,0) model fit the data?

ANSWER:

Questions #5: Report your ARIMA forecasts for the forecast period 1997M1-1997M4, and compare the accuracy with the results from Chapter Six, Case #1.

ANSWER:

Student Practice Question

Question #1: Try re-estimating this case with updated BUDGET data into 2001. Contrast and compare your results with those of this case. Explain!